Model Selection

High-resolution video generation

# High-resolution video generation

Cosmos Predict2 2B Text2Image

Cosmos-Predict2 is a series of high-performance pre-trained world foundation models designed to generate physics-aware images, videos, and world states, which can be used for the development of physics AI.

Wan2.1 T2V 1.3B

Wan 2.1 is a comprehensive open-source video foundation model designed to push the boundaries of video generation, supporting tasks such as text-to-video and image-to-video generation.

Text-to-Video Supports Multiple Languages

Cogvideox1.5 5B I2V

CogVideoX is an open-source video generation model that supports generating videos from images, similar to the Qingying platform.

The first DiT-based video generation model capable of real-time generation of high-quality videos, supporting two scenarios: text-to-video and image + text-to-video.

Text-to-Video English

Cogvideox Fun 5b InP

An improved video generation tool based on the CogVideoX architecture, supporting text/image generation of approximately 6-second, 8fps videos

Text-to-Video English

Cogvideox Fun 2b InP

A video generation model based on the improved CogVideoX architecture, supporting text/image-to-video and multi-resolution generation

Text-to-Video English

The first open-source 1024x576 text-to-video model, fine-tuned from a base model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase